Emotional Speech Corpus Creation, Structure, Distribution and Re-Use
نویسندگان
چکیده
This paper details the on-going creation of a natural emotional speech corpus, its structure, distribution, and re-use. Using Mood Induction Procedures (MIPs), high quality emotional speech assets are obtained, analysed, tagged (for acoustic features), annotated and uploaded to an online speech corpus. This method structures the corpus in a logical and coherent manner, allowing it to be utilized for more than one purpose, ensuring distribution via a URL and ease of access through a web browser. This i s vital to ensuring the reusability of the corpus by third party’s and third party applications.
منابع مشابه
Creation and utilisation of the MediaTeam Emotional Speech Corpus
The MediaTeam Emotional Speech Corpus is currently the largest database of emotional speech for colloquial modern Finnish, containing simulated emotional content. The specific aim of the research is to investigate in detail the phonetic and phonological/linguistic correlates of basic or primary emotions in spoken Finnish, to develop statistical classification methods of emotional speech signals...
متن کاملEmoVoice - A Framework for Online Recognition of Emotions from Voice
We present EmoVoice, a framework for emotional speech corpus and classifier creation and for offline as well as real-time online speech emotion recognition. The framework is intended to be used by non-experts and therefore comes with an interface to create an own personal or application specific emotion recogniser. Furthermore, we describe some applications and prototypes that already use our f...
متن کاملDesigning and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
This paper describes an emotional speech database recorded for standard Basque. The database has been designed with the twofold purpose of being used for corpus based synthesis, and also of allowing the study of prosodic models for the emotions. The database is thus large, to get good corpus based synthesis quality and contains the same texts recorded in the six basic emotions plus the neutral ...
متن کاملDesigning the Latvian Speech Recognition Corpus
In this paper the authors present the first Latvian speech corpus designed specifically for speech recognition purposes. The paper outlines the decisions made in the corpus designing process through analysis of related work on speech corpora creation for different languages. The authors provide also guidelines that were used for the creation of the Latvian speech recognition corpus. The corpus ...
متن کاملrre STC-TIMIT: Generation of a Single-channel Telephone Corpus
This paper describes a new speech corpus, STC-TIMIT, and discusses the process of design, development and its distribution through LDC. The STC-TIMIT corpus is derived from the widely used TIMIT corpus by sending it through a real and single telephone channel. TIMIT is phonetically balanced, covers the dialectal diversity in continental USA and has been extensively used as a benchmark for speec...
متن کامل